Robust fundamental frequency estimation using instantaneous frequencies of harmonic components

نویسندگان

  • Yoshinori Atake
  • Toshio Irino
  • Hideki Kawahara
  • Jinlin Lu
  • Satoshi Nakamura
  • Kiyohiro Shikano
چکیده

This paper proposes a noise-tolerant method for fundamental frequency (F0) extraction. This method includes several new ideas, including the estimation of the instantaneous frequencies of the higher harmonic components, and the design of an adaptive weighting function based on a bandwidth equation that combines the F0 information in the harmonic components. To evaluate the proposed method, we constructed a relatively large database of simultaneous recordings of speech waveforms and EGG (Electro Glotto Graphy). The database consists of 30 sentences pronounced by 14 male and 14 female normal subjects, i.e., 840 sentences in total. The duration of the sound is about 35 minutes including about 20 minutes of voicing. The experiments were performed with additive noise for four pitch extraction methods, i.e., the proposed method, the original TEMPO, an improved cepstrum method, and a common F0 extraction program in ESPS. The results were as follows: 1) the proposed method is always better than any of the other methods when the SNR is greater than about 2 dB; 2) for high SNR values (> 15 dB), the correct rates of the proposed method and the original TEMPO are about 95% and much better than the improved cepstrum method (92%) and the ESPS function (89%); and 3) all of the methods degrade to less than 62% when the SNR is 0 dB. As a result, the proposed method improves the performance for low SNR values and also maintains high accuracy inherent from the original TEMPO for high SNR values.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust and accurate fundamental frequency estimation based on dominant harmonic components.

This paper presents a new method for robust and accurate fundamental frequency (F0) estimation in the presence of background noise and spectral distortion. Degree of dominance and dominance spectrum are defined based on instantaneous frequencies. The degree of dominance allows one to evaluate the magnitude of individual harmonic components of the speech signals relative to background noise whil...

متن کامل

Robust fundamental frequency estimation against background noise and spectral distortion

This paper presents a new method for robust fundamental frequency (F0) estimation in the presence of background noise and spectral distortion. We define degree of dominance and a dominance spectrum based on instantaneous frequencies. The degree of dominance allows us to evaluate the magnitude of individual harmonic components of speech signals relative to background noise while eliminating the ...

متن کامل

Robust Fundamental Frequ against Background Noise And

This paper presents a new method for robust fundamental frequency (F0) estimation in the presence of background noise and spectral distortion. We define degree of dominance and a dominance spectrum based on instantaneous frequencies. The degree of dominance allows us to evaluate the magnitude of individual harmonic components of speech signals relative to background noise while eliminating the ...

متن کامل

On a robust F0 estimation of speech based on IRAPT using robust TV-CAR analysis

Fundamental frequency (F0) estimation is important in speech processing such as speech coding, synthesis, recognition and so on. A present F0 estimation method performs well under clean condition, however the performance deteriorates significantly in noisy environment. As a result, robust F0 estimation against additive noise is demanded. We have previously proposed F0 estimation methods based o...

متن کامل

Dominance spectrum based V/UV classification and estimation

This paper presents a new method for robust voiced/unvoiced segment (V/UV) classification and accurate fundamental frequency ( ) estimation in a noisy environment. For this purpose, we introduce the degree of dominance and dominance spectrum that are defined by instantaneous frequency. The degree of dominance allows us to evaluate the magnitude of individual harmonic components of speech signal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000